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Abstract 

Let G = G(d) be a random graph with a given degree sequence d, such 
as a random r-regular graph where r > 3 is fixed and n = \G\ — > oo. We 
study the percolation phase transition on such graphs G, i.e., the emer- 
gence as p increases of a unique giant component in the random subgraph 
G[p] obtained by keeping edges independently with probability p. More 
generally, we study the emergence of a giant component in G(d) itself as 
d varies. We show that a single method can be used to prove very precise 
results below, inside and above the 'scaling window' of the phase tran- 
sition, matching many of the known results for the much simpler model 
G(n,p). This method is a natural extension of that used by Bollobas 
and the author to study G(n,p), itself based on work of Aldous and of 
Nachmias and Peres; the calculations are significantly more involved in 
the present setting. 

1 Introduction and results 

In 1997, Aldous showed that inside the 'scaling window' of the phase transition, 
i.e., when p = (1 + an _1 / 3 )/n with a — 0(1), the rescaled sizes of the largest 
components of G(n,p) converge to a certain distribution related to Brownian 
motion. His proof was based on a natural exploration process, introduced in 
the context of random graphs by Karp [16] and considered in a closely related 
context a little earlier by Martin-L6f [15] , but with a twist: after finishing 
exploring a component one starts exploring the next component in such a way 
that a certain quantity related to the exploration behaves very much like a 
random walk with independent increments. 

Recently, Nachmias and Peres [22] used the same process (with the same 
'restarts') to study G{n,p) outside the scaling window, giving a simpler proof 
of somewhat weaker forms of known results for this case; in [5] Bollobas and 
the author showed that with a little more work, much stronger results could be 
proved by analyzing the same exploration process in a new way. 
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Nachmias and Peres [23] adapted their approach to study the phase tran- 
sition in random r-regular graphs, i.e., the emergence of a giant component 
in the subgraph of a random r-regular graph obtained by selecting edges with 
probability p; they showed in particular that the 'window' is when p = (1 + 
0(n -1 ' 3 ))/(r — 1). Independently, Janson and Luczak [T3] used a different ap- 
proach to prove the supercritical part of this result in a more general context. 

In this paper we shall extend and generalize the results mentioned in the 
previous paragraph. Firstly, rather than a random subgraph of a random regular 
graph, we study the 'configuration model' of Bollobas [3J, giving a random 
(simple or multi-)graph with a given (here bounded) degree sequence; it is easy 
to see that random subgraphs of random regular graphs can be viewed in this 
way (see Fountoulakis [11]). Secondly, we prove much more precise results, 
obtaining essentially the full strength of the corresponding results for G(n,p). 
In particular, we show that above the window the size of the giant component 
is asymptotically normally distributed, corresponding to the result of Pittel and 
Wormald [35] for G(n,p), and prove results equivalent to those of Aldous [I] in 
the critical case. The approach used here is very much that of Bollobas and the 
author in [6 , adapted to the configuration model. 

Throughout we assume that we are 'near' the phase transition, in the range 
corresponding to np — > 1 in G(n,p). Perhaps surprisingly, the proofs are easier 
the closer the graphs are to critical. The method used here could probably be 
extended to the more strongly supercritical case (covered for G(n,p) in [5]), but 
further away from the phase transition other approaches (based on studying 
small components) are likely to be simpler. We assume here that the maximum 
degree remains bounded; this assumption can doubtless be weakened. 

Before turning to the details, let us comment briefly on the history of this 
problem. The existence and size of the giant component in the configuration 
model were first studied by Molloy and Reed 20, 21], who found the size of the 
largest component up to a o(n) error (not only near the phase transition, of 
course). It is easy to check that a random subgraph of a random graph gener- 
ated by the configuration model is again an instance of the configuration model. 
Hence, studying the percolation phase transition in the (random) environment 
of the configuration model reduces to studying the transition in the configura- 
tion model itself as its parameters are varied. (This is spelled out in detail by 
Fountoulakis [TT]; see also Janson [E].) Nevertheless, percolation on random 
r-regular graphs has received separate attention; for many proof techniques this 
special case is much easier to handle. In this special case, the critical point of 
the phase transition was established explicitly by Goerdt [12] , and Benjamini 
raised the question of finding the 'window' of the phase transition (see [25l[23] V 
Results establishing the approximate width (either for this special case or more 
generally for the configuration model itself) were given recently by Kang and 
Seierstad [T3] , Pittel [25] and Janson and Luczak [T3] , in all cases with logarith- 
mic gaps in the bounds. The exact width of the window, and the asymptotic size 
of the largest component in all ranges, was found by Nachmias and Peres [23] . 
but only for the case of random subgraphs of random r-regular graphs. As men- 
tioned above, here we not only extend this result to the configuration model, but 
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greatly improve the precision, establishing not only the asymptotic size of the 
largest component above and below the window, but also the scale and limiting 
distribution of its fluctuations. 

Let d = d„ = (d\ , . . . , c?l"^) be a degree sequence, i.e., a sequence of non- 
negative integers with even sum. For the moment we assume only that all de- 
grees are at most some constant d max > 2. In the following results the sequence 
of course depends on n, but often we suppress this in the notation, writing di 
for the degree of vertex i, for example. 

Let GJJ 1 be the configuration multigraph with degree sequence d, introduced 
by Bollobas pj], defined as follows. Let Si, . . . , S n be disjoint sets with |St| = d^ 
we call the elements of the S, stubs. Let V be a pairing of [J Si, i.e., a partition 
of (J Si into parts of size 2, chosen uniformly at random from all such pairings. 
Form the multigraph GJJ 1 from V by replacing each pair {s, t} with s € S, and 
£ S Sj by an edge with end vertices i and j. 

Let us write Li{G) for the number of vertices in the ith largest (when sorted 
by number of vertices) component of a graph G, noting that the definition is 
unambiguous even if there are ties in the component sizes. Our main aim is to 
study the distribution of Li(G^ ). 

When it comes to asymptotic results, we shall always make the following 
two assumptions. Firstly, there is a constant d max such that all degrees satisfy 

< rf max- (1) 

Secondly, there is a constant cq > such that 

|{M n) £{0,2}}|>c o n (2) 

for all large enough n. There are two aspects to this second condition. Firstly, 
degree-0 vertices play no role in the construction, so we could just as well rule 
them out. However, in applications we shall consider sequences containing some 
zeros, and to avoid an extra rescaling step (where n is replaced by the number 
of non-zero degrees) , it is convenient to allow them. 

The situation with degree-2 vertices is somewhat similar: apart from the 
possibility of cycle components, the multigraph G^ is a random subdivision of 
the multigraph GJ?, , where d' is obtained by deleting all degree-2 vertices from 
d. For the scaling behaviour, it is the number of (non-isolated) vertices in d' 
that matters, not the number in d. The condition ([2]) ensures that n is (up to a 
constant) the correct scaling parameter in conditions such as e 3 n — > oo below. 

Given a degree sequence d, let 

n 

[i r = fx r (d) = n^ 1 y^(rfj) r (3) 

i=l 

denote the rth factorial moment of d, where (x) r — x{x — 1) • • • (x — r + 1). Let 

A = A(d) = E ^- 1) =-- (4) 
2^ di Hi 
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As is well known (see, e.g., [201 IS), the quantity A corresponds to the average 
'branching factor' in GJf. Our main interest is the 'weakly supercritical' case 
where A — > 1 from above, but (A — l)n 1 / 3 — > oo, so we are outside the 'scaling 
window' of the phase transition. However, we shall also prove results for the 
critical and weakly subcritical cases. 

We usually write A = A(n) as 1 + e(n), and assume throughout that 

A = ^ H- 1 (5) 
Mi 

as n — > oo, i.e., that e — > 0. 

Let rj = rj(n) denote the random variable obtained by choosing an element 
of d at random, with each element chosen with probability proportional to its 
value. Thus A = E(r/ — 1) and e = E(r? — 2). The quantity 

vq = Var(?i) = Var(?7 - 2) (6) 

will play an important role in our results. An elementary calculation shows that 

v = g ■ 

m 

Under our assumptions we have E(?? — 2) — > and, from @, supP(^ = 2) < 1, 
which implies that Var(?7 — 2) = 0(1). Since (12/(^1 1, it follows that 

vo ~ ^ = 0(1) (7) 

as n — > 00. 

Writing pd = Pd{n) — : di — d}\ for the fraction of vertices with 

degree d, for A > 1, let z be the smallest solution in [0, 1] to the equation 

z = J2z d -^, (8) 

and set 

p= P (d) = i-5>v (9) 

d 

This quantity is most naturally seen as the survival probability of a certain 
branching process (see [5] ; z is essentially the probability that when we follow a 
random edge, we end up in a small component). What we call p here is exactly 
the quantity ex> appearing in the result of Molloy and Reed [3T] . Let 



^^(dHj^w-^-i + f ; (10) 

d 



it will turn out that p*n will give asymptotically the nullity (or excess) of the 
giant component. Although we shall not prove this, this is also asymptotically 
the number of vertices in the 'kernel'. 
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We shall show later (see Lemma I3"^1V) | and that 

p~M e and p*~^e\ (11) 

Molloy and Reed [21] showed, under different conditions, that L 1 (G^) = p(d)n+ 
o p (n). (Their result allowed much larger degrees, and they did not assume that 
e — » 0; indeed, their result does not 'bite' in this case.) Our main result concerns 
the fluctuations of L\(G^ ) around p(d)n in the weakly supercritical case. 

Theorem 1.1. Let d m ax > 2 and Co > be fixed. For each n let d = d n be a 

degree sequence satisfying ([1]) and ([2]). Define pi, \, p and p* as above, noting 
that these quantities depend on n. Setting e = A — 1, suppose that e — ¥ and 
e 3 n — > oo. Let L\ and N\ denote the order and nullity of the largest component 
of . Then L[ = L\ — pn and N[ — N\ — p*n are asymptotically jointly 
normally distributed with mean 0, 

Var(Li) - 2pie~ 1 n 1 Nax(N[) - 5p*n - ^§-£ 3 n, and Cov(L[,Ni) - ^-en. 

3p 3 pz 

Furthermore, 

L 2 (GS) = O p (e- 2 log(e 3 n)). (12) 

Here, as usual, given a sequence (Z n ) of random variables and a deterministic 
function f(n), Z n = O p (f(n)) means that Z n /f(n) is bounded in probability, 
i.e., for any S > there exists C such that P(|Z„| < Cf(n)) > 1 — 8 for all 
(large enough) n. We say that an event (formally a sequence of events) holds 
with high probability or whp if its probability tends to 1 as n — > oo. We write 
Z n = °p(f( n )) if Z n /f(n) converges to in probability, i.e., if for any 8 > we 
have \Z n \ < 8f(n) whp. 

We assume a bounded maximum degree for simplicity. The proof extends 
to the case where the maximum degree grows reasonably slowly; we have not 
investigated this further. 

The asymptotic correlation coefficient Cov(L' 1 , N[)/ •y/Var(L' 1 ) Var(7V{) given 
by Theorem II .11 is simply y/3/5. Although this case is not covered by our result, 
if we take the degree distribution to be Poisson as in G(n,p) then when p ~ 1/n 
we have pi ~ 1 for all i, and the variance and covariance formulae above are 
consistent with those given by Pittel and Wormald }261 Note 4] for G(n,p). 

The proof of Theorem 11.11 will show that for each d the number L\(d) of 
degree-<i vertices in the largest component satisfies 

L x {d) =n d {l- z d ) + O p {^nTe), (13) 

where = np,i is the total number of dcgree-ci vertices and z is defined by 
(|SJ|. The parameter z corresponds to £ in [14], so (IT3l) refines the results there. 
Our method would allow us to establish joint normality of these numbers with 
variances and covariances of order n/e, but we shall not give the details. 



5 



We next consider the subcritical case. Writing, as before, pd — Pd{n) for 
the proportion of vertices with degree d, let qd = dpd/ Hi = = d) be the 
corresponding size-biased distribution. Let a n be the (unique - see Section [7]) 
solution to ^2(d — 2)qde anlyd ^ = 0, and define 



V d 



q d e a - {d - 2) . (14) 



Theorem 1.2. Let d max > 2 and c$ > be fixed. For each n let d = d n 
be a degree sequence satisfying |T]) and @. Define fj,i and A as above, noting 
that these quantities depend on n. Setting e = 1 — A, suppose that e — > and 
e 3 n — > oo. Then for all x £ K we /iawe 

Li(G^)<CaogA-|loglogA + x)) -exp(- ce - a; )+ (l), (15) 

where A = e 3 n, c = c(d) = 0(1) is given by (|90[) . and <5„, defined in (jT^J) , 
satisfies 

S n~7, ~ • (16) 



In particular, 



Li(G%) = C(k>gA - 5 bg log A + P (1)), (17) 



and ii(GJ) = (2^/Mi + o p (l))e- 2 log A. 



The bound (I12| in Theorem ll.il is proved by applying Theorem 1 1.2 1 to what 
remains of the supercritical graph after deleting the largest component, so in 
Theorem ll.il we in fact obtain bounds of the type (fT5|) . (|17l) but with c, S n and 
A defined for the 'dual' distribution with ndZ d vertices of each degree d; see (|13|) 
and the remark at the start of Subsection ll.il 

Theorem 11.21 is the equivalent of (the corrected form of, see [5]) Luczak's 
extension [T7] of Bollobas's result |1] for G(n,p) in the subcritical case. 

Finally, in the critical case, we obtain an analogue of the results of Aldous [T] 
for G(n,p). The statement requires a few definitions, analogous to those in pQ. 

Let (W(s))o< s<oa be a standard Brownian motion. Given real numbers cto, 
ai and a2 with ao > 0, let 

9 

1/2 OL^S 

W ao , ai , a2 (s) = a / W{s) + a 1 s — (18) 

be a rescaled Brownian motion with drift ot\ — ot%s at time s, and set 

B(s) = B aQtaita2 (s) = W ao aua2 (s) - min W ao ,ai,aj(s')- ( 19 ) 

0<s'<s 

Also, define a process N(s) of 'marks' so that, given B(s), N(s) is a Poisson 
process with intensity j3B(s)ds, where f3 > is constant. (For the formal details, 
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see pp.) Finally, order the excursions jj of B(s), i.e., the maximum intervals on 
which B(s) is strictly positive, in decreasing order of their lengths |7j-|. Writing 
N("fj) for the number of marks in jj, this defines a joint distribution 

ihilNdj))^ (20) 

that depends on the parameters ao, ai, ct2 and /3. 

Theorem 1.3. Suppose that d = d„ satisfies assumptions (pQ) and @ above. 
Suppose also that n 1 / 3 (A— 1) converges to some a\ G M., i/iai ps/pi — > cyq, that 
P3/P1 — > ct2, and that 1/pi — > j3. Ordering the components by number of ver- 
tices, for any fixed r the sequence (ti -2 / 3 |Cj|, n(Cj))£ =1 converges in distribution 
to the first r terms of the sequence C 1 ! » defined above, where n(Cj) is 
the nullity of Cj . 

Note that in the light of ([7]) , assuming Hs/Vl — ► «o is equivalent to assuming 
v Q -> a . 

We have written Theorem 11.31 with the scaling that arises most naturally 
in the proof. From the scaling properties of Brownian motion, one can check 
that -B Qo ,ai,a 2 ( s ) i s equal in distribution (as a process) to a' B i <a ' ,1 (o^s) where 

, 2/3 -1/3 , 2/3 -1/3 j , -1/3 -1/3 „ -r 

a = a a 2 , ctQ — a 2 ct Q and a\ — ctia Q a 2 ■ Hence, if we con- 
sider only the component sizes, there is a single-parameter family of limiting 
processes, characterized by a[, the limiting value of n 1 ^ 3 (A — l)/zi/i 3 2 ^ 3 . Not- 
ing that the excursion lengths are scaled by a' , Theorem 11.31 shows that if 
n 1 / 3 (A — l)pip^ 2 ^ 3 a , then (n -2 / 3 p^ 3 p^ 1 ^i)I=i converges to the first r 
sorted excursion lengths of Bi :0li i. These excursion lengths are exactly the 
rescaled component sizes appearing in Aldous's result for G(n, l/n + on -4 / 3 ). 

If wc also consider the nullities, or mark counts, then there is a two-parameter 
family of possible limits. 

1.1 Applications and Extensions 

Let d satisfy the assumptions above (in particular, (HJ) , (|2]) , (|5]) and A = e 3 n — > 
00). Observing at all times the restriction that all degrees are at most d max , 
changing to entries of d changes the proportion pa of degree-d vertices by 
0(m/n), and hence changes the quantities pi, P2 and /13, which are all of order 
1, by 0(m/n). It follows that e = A — 1 = P2/P1 — 1 changes by an absolute 
amount that is 0(m/n), so if m = o(en) the relative change in e is 0(m/(en)). 
It is easy to see, and will follow from the results later in the paper, that the 
relative change in p, p* , A = e 3 n or S n is of this same order 0(m/(en)), as one 
would expect from the formulae (fTTj) and (TT1)1) . 

When to = o(y /l n/e), the changes in pn and p*n are small compared to the 
relevant standard deviations, and it follows that Theorem 11.11 applies just as 
well to Gj, for any d^ obtained from d„ by changing o(yJn/e) entries, even 
when all quantities in the conclusion of the theorem are calculated for d rather 
than for d'. Similarly, if m = 0(y/ n/s) then the relative change in 5 n is 
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0(1/VA) = o(l/ log A), and it is not hard to check that Theorem 11.21 applies 
to GJJ 1 , in this case; for Theorem 11.31 the corresponding condition is simply 
m = o(n 2 / 3 ), so that the limit of n 1 / 3 (A— 1) is unchanged. Note that m = 0{y/n) 
satisfies the bound on m in all three cases. 

One application of these observations concerns random subgraphs of random 
r-regular graphs, or indeed random subgraphs of configuration (multi-)graphs. 

Let d be the (random) degree sequence of the graph G™ [p] obtained by start- 
ing with an r-regular configuration multigraph, selecting edges independently 
with probability p, and retaining the selected edges. Conditional on d, the dis- 
tribution of G™[p] is simply that of Gd- Hence, as noted by Fountoulakis [TTj . 
one can study G™[p] by studying Gg 1 . 

It is very easy to check that for < d < r, the proportion pd of degree-d 
vertices in d satisfies 

Pd = p d + O p (n- 1 /2 ) where p o = Qp d (l - pf- d . 

It follows that the quantities Hi defined above satisfy 

Hi = $ + O p {n- 1/2 ) where Hi = P l ( r )i = P lr ( r - 1) • • • (r - i + 1). 

Note that A = (4/(1° = p(r - 1), and when A 1, i.e., p ~ l/(r - 1), then 
A 1 ? ~ r/(r — 1) and H3 ~ r(r — 2)/(r — l) 2 . From the remarks above, even 
though the actual number of vertices of each degree is random, Gg will satisfy 
the conclusions of Theorem I1.1H1.3I for the idealized sequence with pd replaced 
by p d . Hence, defining p° by |8]) and ([9]) with pd replaced by p d , Theorems 1 1.11 - 
ll.3l have the following consequence. (We omit the nullity result and the analogue 
of (|15[) for simplicity; these also carry over.) 

Corollary 1.4. Let r > 3 be fixed and letp = (l+e)/(r— 1) where e — e(n) — > 0. 
Let G be the random subgraph of the random r-regular configuration multigraph 
on n vertices obtained by selecting edges independently with probability p. 
If e > and e 3 n — > 00 then 

Ll(G) : A 4jy(o,i), 

where p° (defined above) satisfies p° ~ 2er/(r — 2), and a 2 = 2e~ 1 r/(r — 1). 
If e < and \s\ 3 n — > 00 then 

L 2 (G) = a- 1 (log A - I log log A + O p (l)), 

where A = |e| 3 n and S n , defined by p4p with p° d in place of pd, satisfies S n ~ 

r-l „2 
2(r-2) fc ' 

Finally, if n^'^e — > ot\ G K then the sizes and nullities of the components 
sorted in decreasing order of size converge in distribution to the distribution 
described in (|20p with ao = (r — 2)/(r— 1), 012 — (r — 2)/r and f3 — (r — l)/r.D 
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Note that this result is consistent with, but much sharper than, the results 
of Nachmias and Peres [23 . Of course, one can formulate a similar corollary 
concerning the random subgraph G£{p] of GJJ 1 , for any degree sequence d sat- 
isfying (p} and ©; passing to the subgraph multiplies fii by p l , just as in the 
r-regular case, and p must be chosen so that the value of the 'branching factor' 
A in the subgraph (p times that in the original) is of the form 1 + e with e-)0. 

As shown by Bollobas [3] (see also [2]), when the maximum degree is bounded, 
the probability that the configuration multigraph G^ is simple is bounded away 
from 0, and conditional on this event, has the distribution of Gd, a uni- 
formly random simple graph with degree sequence d. It follows that any 'whp' 
results for GJj 1 transfer to Gd- This applies to Theorem II .21 in the weaker form 
(fT7|) , but not to the other results above. More precisely, as shown in [3] (or, 
after translating from the enumerative to probabilistic viewpoint, [2]), when 
the maximum degree is bounded, the probability p that GJJ 1 is simple satisfies 
p ~ exp(— 8 — 9 2 ), where 9 — J2i (2) / J2i which in our notation is /i2/(2/*i). 

The proofs of Theorems I1.1H1.3I involve 'exploring' part of the graph. We 
can end these explorations when certain entire components have been revealed, 
comprising in total o(n) vertices. It is easy to check that the probability of 
encountering a loop or multiple edge in the exploration is o(l). Moreover, the 
unexplored part of the graph may be seen as a configuration multigraph G. 
Since only o(n) vertices have been explored, the (conditional) probability that G 
is simple is p+o(l): the corresponding 9 is within o(l) of the original 6. It follows 
that if £ is some not too unlikely event defined in terms of our exploration, then 
the probability that £ holds and G% is simple is {¥(£) +o(l))(p + o(l)) ~ p¥(£). 
Thus conditioning on G^ being simple hardly changes the probability of £ . 
Using this observation it is easy to transfer the results above to random simple 
graphs; we omit the details. 

Theorem 1.5. All the results above apply unchanged if Gjj is replaced by the 
random simple graph Gd- Q 

Note that in the analogue of Corollary 11.41 this means that we consider a 
random subgraph of a random r-regular simple graph. Here one must be slightly 
careful with the argument: we need to explore the subgraph, but then check 
whether the original graph is simple. 

The rest of the paper is organized as follows. In the next section we define 
the exploration process that we study, and two corresponding random walks 
(X t ) and (Y t ). In Section |3] we establish some key properties of (Xt), includ- 
ing the 'idealized trajectory' that we expect it to remain close to. Using these 
properties, and assuming Theorem 11.21 for the moment, we prove Theorem 11.31 
in Section @] and Theorem 1 1.1 1 in Section [5] In Section [5] we prove a local limit 
theorem (Lemma 16. 3|) for certain sums of independent random variables. Fi- 
nally, in Section [7] we prove Theorem 1 1.2 1 The proof (which uses the local limit 
theorem) is rather different from that of the other main results: we use domi- 
nation arguments to study the initial behaviour of (Xt), rather than following 
its evolution as in the main part of the paper. This is the reason for postponing 
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the proof, even though (a weak form of) the result is needed in the proofs of 
Theorems 11.11 and 11.31 to rule out 'other' large components. 

2 The exploration process 

Consider the following exploration process for uncovering the components of 
G = G™. This is a slight variant of the standard process, in that we check for 
'back-edges' forming cycles as we go. A form of this variant was used by Ding, 
Kim, Lubetzky and Peres [9] in their study of the diameter of G(n,p); we could 
in fact use a more standard exploration here, but the variant results in slightly 
cleaner calculations. 

We shall define an exploration so that after t steps of the process, t vertices 
have been 'reached'; the other n — t are 'unreached'. Furthermore, a certain 
random number of stubs will have been paired with each other, and each un- 
paired stub will be either 'active' or 'unreached'; the active stubs are attached 
to reached vertices, the unreached ones to unreached vertices. We write A t for 
the number of active stubs, and Ut for the number of unreached stubs. The 
process we define will be such that in the complete pairing V, the active stubs 
are paired to a subset of the unreached stubs, and the remaining unreached 
stubs are paired with each other. Moreover, the conditional distribution of the 
pairing V given the first t steps of the process is such that all pairings of the 
active and unreached stubs satisfying this condition are equally likely. 

At step t + 1 of the process, if A t > then we pick an active stub a t +i, for 
example the first in some order fixed in advance. Then we reveal its partner 
Ut+i in the random pairing V, which is necessarily unreached. Let Vt+i be the 
corresponding unreached vertex. The stubs at+i and Ut+i are now paired (so in 
particular at+i is no longer active), and the remaining stubs Ut+j.,1, . . . , ut+i, rt+1 
attached to v t +i are provisionally declared active. But now 

(i) we check whether any other (previously) active stubs are paired to any 
of the u t +i.i, and then 

(ii) we check whether any of the remaining u t +i.i are paired to each other. 
We declare any pairs found in (i) or (ii) 'paired', and continue. These pairs 
correspond to 'back-edges'. 

If At = then we simply pick Vt+\ to be a random unreached vertex, chosen 
with probability proportional to degree, provisionally declare all its stubs active, 
and then perform the second check (ii) above. In this case we say that we 'start 
a new component' at step t + 1. 

For (partial) compatibility with the notation in [5], let rjt+i denote the degree 
of v t +i- We write t +i for the number of back-edges found during step t + 1. 
and Y t = J2i<t ^ f° r the total number of back-edges found during the first t 
steps. 

Let Ct be the number of components that we have started exploring within 
the first t steps, and set 

X t =A t - 2C t . (21) 
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Considering separately the cases A t > and A t = 0, and noting that finding a 
back-edge pairs off two stubs, we see that 

X t+1 -X t = Th + i-2-20 t+ i, (22) 

while by the definition of Y t , 

Y t+1 -Y t = 6 t+1 . (23) 

Let Ft denote the (finite, of course) sigma-field generated by the information 
revealed by step t. Let Ud,t denote the number of unreached vertices of degree 
d after t steps, so, recalling that U t denotes the (total) number of unreached 
stubs, we have 

U t = Y, dU d,t- (24) 

d 

In both cases above, the vertex Vt+i is chosen from the unreached vertices and, 
given Ft, the probability that any given vertex is chosen is proportional to its 
degree. Hence 

P( Vt +i = d\F t ) = dU djt /U t . (25) 

In particular, 

E(r) t+1 - 1 | F t ) = Ui 1 d ( d ~ l ) u d,t- 

d 

In the analysis that follows, we shall impose the assumption 

t < c n/2, (26) 

where cq is the constant in Since at least cqti vertices have degree at least 1, 
(|2"6"|) implies that 

U t > c n/2, (27) 

and in particular that Ut = Q(n). This will simplify some formulae in the 
calculations. 

Suppose that A t > 0. Then, given Ft and i>t+i, the expected number of 
pairs discovered during check (i) above is exactly 

(At - = (Vt+i l)A t /U t + 0(l/n), 

using Ut — O(n) for the approximation. Indeed, each of the A t — 1 other stubs 
that were active before this step is equally likely to be paired to any of the Ut — 1 
remaining unreached stubs. We could write an exact formula for the expected 
number of pairs found during check (ii) , but there is no need: instead we simply 
note that the expectation is 0(l/n). It follows that 

E(0t+i | Ft, m+i) = (nt+i - l)A t /U t + 0(l/n), (28) 

and hence that 

E(0 t +i | Ft) = E( Vt+1 - 1 | Ft)A t /U t + 0(l/n). (29) 
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The last two formulae are also valid when A t = 0: this time there is no check 

(i) , and the expected number of back-edges found during check (ii) is 0(l/n). 
From (glD and flU) it follows that 

E(X t+1 - X t | F t ) = -1 + E(rh +1 - 1 | F t )(l - 2X t /U t ) + 0(C t /n). (30) 

We use Ct > 1 for t > 1 to avoid writing a separate 0(l/n) error term, not that 
it matters. 

Simply knowing the expected changes at each step is good enough to allow 
us to deduce fairly tight bounds on the size of the giant component. But for 
asymptotic normality we need a bound on the variance. We could give a formula 
that is useful when A t is comparable with Ut, but we shall not need this. Instead, 
we simply note that the number 9t+i of pairs found during our checks (i) and 

(ii) is bounded, and for t = o(n), which implies A t = o(Ut), we have 9t+i = 
whp. Since r]t+i is bounded, it follows easily that for t = o(n) we have 

Var(X t+1 - X t | Ft) = Varfo+i | F t ) + o(l) = v Q + o(l), 

where vq is defined in (IBJ, and the second equality follows from the fact that 
only o(n) vertices have been 'used up' by time t. 

Since A t > 0, with equality only when we have just finished exploring a 
component, the times t\, . . . , i c (G) a ^ which we finish exploring components are 
given by 

U = min{£ : X t = -2i}. (31) 

Since exactly one vertex is revealed at each stage, if Ci denotes the ith component 
explored, then 

\Ci I t% ti— i , 

where we set io = 0. 

Recall that Y t denotes the number of back-edges found within the first t 
steps. Then Y ti — Y ti l is simply the nullity of Cj: 

n(C i )=Y u -Y u _ 1 . (32) 

In the rest of the paper we shall study the behaviour of the random walks 
{X t ) and and use this to prove our main results. 

3 Trajectory and deviations 

As in [B], we shall write the difference X t +i — X t as D t + A t , where D t — 
E(X t +i — X t | Ft), so the A t may be regarded as a sequence of martingale 
differences. It is more or less automatic that X)i<t A» is asymptotically normally 
distributed, so we need to understand the sum of the D t . Each D t depends on 
X t , but it turns out that the dependence is not very strong. So if we can find an 
'ideal' trajectory (corresponding to all A t being equal to zero), then bounding 
the deviations of (Xt) from this trajectory will not be too difficult. 

The first problem is that the term Ut appearing in (|25|) is not so simple. We 
start with some lemmas. The first is a standard result about order statistics. 
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Lemma 3.1. Let Z\, . . . , Z n be i.i.d. samples from a distribution Z on [0,1] 
with distribution function G(x) = F(Z < x), and let N n (x) = \{i : Zi < x}\. 
Then for any (deterministic) function y = y(n) we have 



sup \N n (x)-nG{x)\=O p (y/nG(y)). (33) 

0<x<y 

Proof. Let A have a Poisson distribution with mean n, and given A, let Z[, . . . , Z' A 
be i.i.d. with distribution Z, so the set {Z^} forms a Poisson process on [0, 1]. (If 
Z has a density function g(x) = G'(x), then the intensity measure of the Poisson 
process is ng(x)dx.) Writing N'(x) for the number of Z[ in [Q,x], consider the 
random function F(x) — N'(x) — nG(x). From basic properties of Poisson pro- 
cesses, this function is a continuous-time martingale on [0, 1] with independent 
increments. Hence Doob's maximal inequality jlO, Ch. Ill, Theorem 2.1] gives 

. . . . . Var(Ffy)) nG(y) 
P(sup \F(x)\ >t)< = 

since Var(F(j/)) = Var(iV'(y)), and up to an additive constant, N'(y) is simply 
Poisson with mean nG(y). It follows that 



S up\F(x)\=O p (y/nG^y)). (34) 

x<y 

The expected value of \A — n\ is 0(y/n). When A > n, delete a random 
subset of the points {^'} of size A — n. When A < n, add n — A i.i.d. new 
points to {Z[} with distribution Z. In this way we obtain a set of n i.i.d. 
samples from Z. Given A, the added/deleted points have distribution Z, so the 
expected number in [0,y] is \ A — n\G(y). Hence the unconditional expectation 
of the number of points added or deleted in [0,y] is 0(y/nG(y)) — 0(y / nG(y)). 
Hence 131 follows from flM). □ 



In our exploration process, the next vertex vt+i is always chosen with prob- 
ability proportional to its degree. Hence the (distribution of) the random se- 
quence a = (vi, . . . ,v n ) has the following alternative description: first assign 
a random order to all stubs. Then sort the vertices so that v comes before w 
if and only if v's earliest stub comes before w's earliest stub. In turn, we may 
realize the random order on the stubs by assigning i.i.d. U[0, 1] variables to the 
stubs; we shall call these variables stub values. 

For each t, there is a random 'cut-off' Z t € [0, 1] so that a vertex v is among 
if and only if its smallest stub value is at most 1 — Zt- Fixing a cut-off 
value z, the expected number of vertices with all stub values at least 1 — z is 
exactly ^2 d n,iz d , so we expect to have 

n - t ~ Yl n d Zf. 

d 

Let f(z) = f n {z) denote the probability generating function of the degree 
distribution d, so 

d=l 
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Recall that Ud.i denotes the number of unreached degree-d vertices after i steps, 
i.e., the number of degree-d vertices not among the first i elements of our random 
order on the vertices. 

Theorem 3.2. Let d = d„ be any degree sequence of length n with all degrees 
between 1 and some constant d max , and let (vi,... ,v n ) be the random order on 
the vertices defined above. Define a function t m- z(t) from [0, 1] to [0, 1] by 
/(z(t)) + t = 1, where f is the probability generating function of d. Then for 
any t = t(n) > 1 we have 

max max \Ua i — tiazU I 'n) d \ = Or,(Vt), 

l<d<d max l<i<t 

where n d is the number of degree-d vertices in d. 

Proof. Construct the sequence Vi, . . . ,v n from stub values as above. Let 
denote the distribution obtained by taking the minimum of d independent U[0, 1] 
random variables. Note that has distribution function Gd{x) = 1 — (1 — x) d . 

For each d, let Zd,i denote the minimum stub value of the ith degree-d 
vertex. Then Zd,i, • • • , Zd,n d are i.i.d. with distribution Z^ d \ Let mg(x) denote 
the number of degree-d vertices whose smallest stub value is at most x. Then 
by Lemma |3.1[ for any (deterministic) y — y(n) and any d we have 

Ed.y = sup \m d (x) - n d Gd(x)\ = Op(v 'n d G 'd{y)) ■ 

0<x<y 

Summing over d, and using Y2%=i V®i — V^VS we nave 

d 

where 

G(x) = V —G d (x) = 1 - /(l - x). (36) 
ti n 

Let m(x) = ^2i d fn d {x). Then by the triangle inequality we have 

sup \m(x) - nG{x)\ < E y . (37) 

0<x<y 

Define y by nG(y) — t, so E y = O v {^ft). For 1 < i < t set x% = \ — ziijn). 
From ((36]) we have G(xi) = 1 — f(z(i/n)), so by the definition of the function 
z, we have G{xi) — i/n, i.e., nG(xi) = i. Note that x\ < x-i < ■ ■ ■ < xt = y. 

From (|37p . for every i < twe have \m(xi)— i\ < E y . Furthermore, the number 
of degree-ci vertices among the first m(xi) vertices, namely m d (xi), differs from 
n d G d (xi) by at most Ed, y < E y . It follows that the number of degree-c? vertices 
among the first i vertices differs from ndGd{xi) by at most 2E yi i.e., 

\{n d - U d ,i) - n d G d {xi) \ < 2E y . 

The difference on the left is exactly \Ud.i — ndz(i/n) d \, so the result follows. □ 
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Returning to our process, recall that the vertices Vi,V2, ... are chosen ac- 
cording to the random distribution considered above. Recall also that when 
(j2"o) holds, then U t = 6(n). 

Corollary 3.3. Define z(t) by f(z(r)) + t = 1. Then for any t < con/2, 
writing z for z(i/n), we have 



sup 

i<t 



U, 



O p {Vt/n) 



and 



sup 



/'(*) 



= O p (Vt/n). 



(38) 
(39) 



Furthermore, if t = o(n), then 



sup |Var(?7 i+1 - 1 | J 7 ,) - v \ = 0(t/n) = o(l), 

i<t 

where vq is defined by ©• 

Proof. By Theorem 13.21 there is some random K t such that K t = O p (y/i) and, 
for all i < t and all d < d max , 



(40) 



\U dii -n d z(i/n) d \ < K t . 

Fix i and write z for z(i/n). Noting that zf'(z) = n^ 1 ^ d dn d z d : (|24|) and (|40 
give 



\Ui/n-zf\z)\ = - 



dUd ^ -^dn d z d 



d=l 



Since K t — O p (y/t), this proves (|38)l . 

For (j39]h note that P(^+i = d | J^) is by definition equal to dU d ^/U~i (see 
pgjl). so when PQj holds, P(r? i+ i = d | J 7 ^) is within 0(K t /n) of 



£?d(z) = 



dllrlZ 



The bound (|39| follows by noting that 

^ _ Zd(d-l)n d z d _ z\f"{z) _ zf"(z) 

Y )qd{ ) ~ E d dn d z« - zf'(z) - f(z) ■ 

The variance bound is immediate from the fact that by time t we have 'used 
up' 0{t) vertices of each degree, changing the conditional variance by at most 
0(t/n). □ 

Let us now define the idealized trajectory that we have in mind. Recall that 
f(z) = f n (z) is defined by (|35|) . and z — z(t) by 



f(z)+T=l 



(41) 
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for < t < 1, so 

- = -— . (42) 
dr f'(z) { ) 

We shall think of t as a rescaled time parameter, taking r = t/n, but will write 
our trajectory as a function of z. In the light of our assumption (|26[) . we need 
only consider r < cq/2. It is easy to check that this implies z > 1/2. In fact, 
we can always assume that r = o(l) and so z — 1 — o(l). 

Let /ii = /'(l) = n~ l J2^ n d be the average degree in our graph G — GJ, 
and define functions x(t) and u(r) by 

x = zf'(z) (43) 
Mi 

and 

u = zf'(z). (44) 

Using elementary calculus, it is straightforward to check that these functions 
satisfy x = when r = 0, i.e., when z = 1, and 

dx dx dz _ zf"(z) ( 2x\ , „„. 

d; = didr = - 1+ w(^-J- (45) 

Recall that / may depend on n. Since at least cqu vertices have degree 
between 1 and c? max , for z > 1/2 (which is the only range we consider), we have 
f'(z) > c 2 _dmax , so f'(z) is bounded below away from zero. Also, any given 
derivative of f(z) is bounded by a constant depending only on <i max . Using (j42]) 
it follows easily that the derivative of any fixed order of x with respect to r is 
bounded uniformly in (large enough) n. 

One can check that 

= 2(/') Y" + Mi</") 2 - Mi/7" - Ml*/'/'" 

Substituting 2 = 1 and noting that — f^ l '{z)\ z —\, we have 
d 2 



+ /ii/i 2 - - MiM3 = Mi(miM3 + M2 ~ MiM2)- 



By assumption A = ^2/^1 ~ 1, an d as noted in Section^ ^3 = 9(1) (see ([7])). 
Hence 

<] "' ~-j3 = -e(i). (46) 

r=o Mi 



dT 2 ^ 



Let us collect together some basic properties of the trajectory x. We write 
x for the derivative of x with respect to r. 

Lemma 3.4. Suppose that d satisfies the assumptions ([1]) and ([2]), and t/iat 
A = A(d) ->■ 1. T/ierc 

(i) 1(0) = 0,. 
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(ii) i(0) = A — 1 and 

(iii) x(t) = + o(l), uniformly in t — o(l). 

Suppose in addition that e = A — 1 > and that e 3 n —> 00, and let p = p n be 
defined by (J9]). Then also 

(iv) x{p) = i), 

(vi) x(p) r~j —e and 

(vii) x(t) > et/2 whenever < r = o(e). 

Proof. We have noted (i) already. Substituting z = 1 (corresponding to r = 0) 
into (|45|) gives (ii). (iii) follows from (|46|) and the fact (noted above) that the 
third derivative of x is uniformly bounded over r = o(l). 

Turning specifically to the supercritical case, (iv) follows easily from (|43ll and 
(|9|). Indeed, recalling that / is the generating function of our degree distribution, 

says exactly that z is the smallest positive solution to z = f'{z)j p,. From 
(|4l?l) we have x = at this value of z. Now © says that p = 1 — /(z) which, by 
our change of variable formula (|41l) . is exactly the corresponding value of r. 

(v)-(vii) follow from (i)-(iv) and Taylor's Theorem. □ 

For 1 < t < con/2 set xt = nx(t/n) and ut = nu(t/n); our aim is to show 
that X t will be close to xt and /7* close to u t ; the functions a; and u are the 
corresponding 'scaling limits'. Since, as a function of r, x has uniformly bounded 
second derivative, we have 



da; 

x t+ i - x t = — 
dr 



+ 0(l/n). 

=t/n 



Hence, from (|45[) , writing z for z(t/n), and defining w t = ^jTT^y > w e have 

SHi-H = -l + i«t^l-^ +0(l/n), (47) 

which is strongly reminiscent of (1301) . Our aim is to show that (Xt) will whp re- 
main close to (xt), and use this, and the asymptotic normality of the deviations, 
to prove Theorem ll.il 

For the rest of the section we fix some i max = tmax(^) = °( n ) ( see the next 
two sections for specific values). In what follows, we shall only consider values 
of t up to < max . 

Set 

D t = E{X t+1 - X t I F t ), 
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noting that D t is random. Corollary 13.31 shows that for any deterministic t — 
t(n) with t < t max , there is some random K t satisfying 

K t = O p (Vt/n) (48) 

such that \Ui — Ui\/n < K t and \&(rj i+ i — 1 | Ft) — wi\ < K t for all i < t. 
Recalling that Ui > cqu/2 in the range we consider, and noting that 77^+1 is 
bounded by <i max , it follows using (f30]) and (|47|) that for i < t < i max we have 



I A - (si+i -Xi)\<c( {Xi Xil +Kt + ^), (49) 
\ n n J 

for some constant c that depends only on d max and Cq. Since Xo = 0, we may 
write X t as 

X t = ^(Di + A,) = A + St, 

i<t i<t 



where 



Let 



St =E A - 



i<t 



X t = xt + S t , (50) 

which we shall think of as a (rather precise) random approximation to X t , and 
dehne the 'error term' E t by 

E t =X t -X t = ^D i -x i . (51) 

i<t 

Recall that xt is deterministic. The key point is that the distribution of St 
is easy to control, since (St) is a martingale with bounded differences. Let 

M t = max ISA. 

0<i<t 

Lemma 3.5. For any (deterministic) t = t(n) and any m > we have 

P(Mt > m) < d 2 ma J/m 2 . (52) 

In particular, M t — O v (s/t). Furthermore, for any a = a(n) > and any 
u> = cu(n) — > 00, the event {Mt < at/4 for all w/a 2 < t < t max } holds whp. 

Proof. Since the differences Aj are bounded by d max , their (conditional) vari- 
ances are at most d 2 ^^, so Vax(St) < d^f. Applying Doob's maximal inequal- 
ity gives (j52")l . That M t = O p (y/t) follows immediately. 

For the last part, let ti = 2 i oj/ct 2 , and let Ei be the event that M ti+1 > crfj/4. 
Then ([52) gives 

S aH 2 /16 S 2>u • 
It follows that J^P(£j) = o(l), so whp no £j holds, giving the result. □ 
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Lemma 3.6. Let t = t(n) satisfy t — > oo and t = o(n). Then St is asymptoti- 
cally normal with mean and variance Dot, where Vq is given by ([6]). 

Proof. Recall that (St) is a martingale with So = 0, and the differences Aj are 
bounded. Moreover, by Corollary 13.31 Var(A i+ i | J 7 ,) ~ v when i = o(n). The 
result thus follows from a standard martingale central limit theorem such as 
Brown [7J Theorem 2]. □ 

We now turn to the error terms E t . Using the second expression for E t in 
(j51"j) . summing (|4"9")l over 1 < i < t, and noting that |Xj — Xi\ = \Si + Ei\, for 
t < imax we see that 

(t tC 
- max \Xi - Xl \+tK t + — 
n i<t n 

ct . . ftM t tCt-i 
< - max \EA + c -+tK t 1 



n i<t \ n n 

Since ct/n < ct max /n = o(l) is less than 1/2 if n is large, then for n large enough 
(which we assume from now on), it follows by induction on i that 

\E i \<2c( t -^+tK t + t -^±) (53) 
\ n n J 

for < i < t. 

Set 

M* t = 2c + tKt^j . (54) 

Note that M t * is increasing in Also, for < t < t max , (|53|) (applied with i = t) 
gives 

|JSt| < M* + 2ctC t _i/n. (55) 
For any (deterministic) i = t(n), it follows from (|48p and Lemma T3. 5 1 that 

M t * = O p (i 3 / 2 /n). (56) 



4 The critical case 

Using the bounds from the previous section, it is very easy to prove Theorem ll.3l 
The hardest part is establishing that the description of the limit actually makes 
sense; this follows from the results of Aldous [I] by simple rescaling. Since all 
probabilistic technicalities are same as in [T], we shall not mention them. Indeed, 
we take a combinatorial point of view in the estimates that follow. 

Proof of Theorem ll.'A Recall that by assumption rt 1 / 3 (A — 1) — > cti € K, while 
/W^i — * a o and A*3/jUi —> cti with ao, a.2 > 0. For the moment, let u be a large 
constant. A little later we shall allow u> to tend to infinity slowly. 

Define a random function S(s) on [0,w] by setting S(t/n 2 ^ 3 ) — n~ x ^St for 
< t < ujn 2 / 3 : and interpolating linearly between these values. Recall that St 
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is a martingale with bounded differences and that for t < wn 2 ' 3 = o(n), the 
conditional variances of the differences are Vo + o(l) ~ ^3/^1 ~ «o (see ©). 

1 /2 

It follows easily that S converges to a W where Wis a standard Brownian 

motion on [0,w], in the sense that these random functions can be coupled so 

1/2 

that sup sg [o,w] \S( S ) ~ a o ^( s )l converges to in probability. (To see this, one 
can apply a functional martingale central limit theorem, or simply subdivide 
into suitable short intervals and use a standard martingale CLT.) 

Recalling that X t = xt + St — nx(t/n) + St, define a random function X(s) 
on [0, w] by setting 

X(s) = n~ x ' 3 X t = n 2 / a a;(s»~ 1/a ) + S(s) 

whenever s = t/n 2 / 3 for integer t < urn 2 / 3 , and again interpolating linearly. 
Note that X is given by adding a deterministic function to S. From Lemma l3.4l 
and Taylor's theorem, we have 

n^x^rT 1 ' 3 ) = O + n^iX-^sn-^-n 2 / 3 + o(l) 

= OCXS - — S + 0(1), 

uniformly in s € [0,w]. It follows that X converges to the inhomogeneous 
Brownian motion W a0iai ,ct2 defined in (jl8[) . 

Setting t max = ujn 2 / 3 , by (|56|) the quantity M 4 * defined in (|54|) satisfies 

Mt w = O p (t 3 ^ 2 Jn) = O p (l) - op^ 1 / 3 ). 

Pick some (deterministic) /c = fe(n) with k/n 1 / 3 — > 00. Suppose that we explore 
more than k components in the first t max steps. When we finish exploring the 
fcth component we have X t — —2k and C't = k. Since t ma x = o(n), the bound 
(|55|) thus gives \E t \ < O p (l) + o(k). In particular, whp \E t \ < k. It then follows 
that \X t \ > \X t \ — \E t \ > k. From the convergence of (X t ) to W ao , ai . a2 we 
have sup t < tmax \X t \ = O p (n 1 / 3 ). It follows that C tmax = O p (n 1 / 3 ). Using l[55)l 
again, this gives sup t<t \E t \ — O p (l). In other words, the 'idealized random 

trajectory' X t is an extremely close approximation to X t up to time t max . Using 
(Xt) to defining a function X on [0,w] as for S and X above, it follows that X 
also converges to W ao ,ai,a 2 - 

Finally, recalling that X t = At — 2Ct, and that X t first hits —2k when we 
finish exploring the fcth component, i.e., just before Ct increases to k + 1, it is 
easy to check that for t > we have A t = X t — mini <t X% + 0(1). Defining a 
final function A on [0,w] using the A t , we see that A converges to the function 
B defined in P^|) . So far u> was fixed, but convergence for all fixed cj implies 
convergence for oj — > co sufficiently slowly. Now the sizes of the components 
explored during the first urn 2 / 3 steps are simply r?l 3 times the excursion lengths 
of A, which converge to the excursion lengths of B. (This follows from basic 
properties of B.) 
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For the component sizes, it remains only to show that when us — > oo, for 
any S > 0, whp there are no components of size at least 5n 2 / 3 in the rest of the 
graph. This follows from Theorem 11.21 by an argument similar to that at the 
end of Section O 

Finally, we also claimed convergence for the nullities, or numbers of back- 
edges, to appropriate Poisson parameters. For t = o(n), which implies Ut ~ 
and Efo+i - 1 | T t ) ~ E(ry - 1) = A ~ 1, from ([29]) we have 

E(0 t+1 | T t ) - (1 + o(l))— + 0(l/n). 

Mi" 

In terms of the rescaled function A on [0, w], this corresponds to formation 
of back-edges at rate A(s)/[Xi, and joint convergence of the component sizes 
and back-edge counts to the excursion lengths and mark counts claimed in the 
theorem follows easily as in [T]. □ 



5 The supercritical case 

We follow the argument in 6. closely, and attempt to use the same notation 
where possible. Throughout this section we fix a function u> = ui(n) tending to 
infinity slowly, in particular with a; 6 = o(e 3 n) and lu 2 — o(l/e). As in [6], set 

(To = \fen 

and 

ignoring, as usual, the irrelevant rounding to integers. Note for later that to = 
o(en), and that to > tu/e 2 if n is large. We shall apply the results of Section [3] 
with, say 

W = 4— en ~ 2pn = 8(en); (57) 

see Lemma 13. ^[v)[ 

Lemma 5.1. Let Z denote the number of components completely explored by 
time to, and let T — ini{t : X t = —2Z} be the time at which we finish exploring 
the last such component. Then Z < uj/s < ct /uj and T Q < uj/e 2 < ao/(eui) hold 
whp. 

Proof. Set k = ui/e and t' = ui/e 2 . It is easy to check that k < oojio and 
t'o < fo/(ew), so it suffices to prove that Z < k and To < t' hold whp. 

Note first that t = ^(n/e) 1 / 2 , so 4 /2 /n = cj 3 / 2 (e 3 n)-^ 4 = o{uj 3 ' 2 ) = o(k). 
Let A\ denote the event that MT < fc/8, so A\ holds whp by (|5T)]) . Let A% be 
the event that M t > < k/8, and ^3 the event that M t < et/4 for all t' < t < t n . 
Then, noting that k/^/tf^ = \fuj — > 00, the events A2 and A3 hold whp by 
Lemma 13.51 
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From Lemma [3.41 we have x% — nx(t/n) > et/2 > for all t < to = o(en). 
Suppose that A = A\ fl A2 n A3 holds. Then we have x t + S t > x t — M t > 
et/A — /c/8 for all t < t , using A2 for t < t' , and A3 for t > t' . 

For t < to, from and the fact that to = o{n) we have 

|£ t | < M* + 2ciC t /n < fc/8 + C t 

if „4i holds and n is large. 

At time T we have Xt = — 2Z (see ((3T|) ) and Ct — Z. Suppose that A 
holds. Then X To = X To -E t < -Z+k/8. Since X To = x To +S To > eT /4-fc/8, 
it follows that 

-Z + fc/8 > X To > eT /4 - fc/8. 

Rearranging gives Z < fc/4 — eTo/4 < k/A and hence, since Z > 0, To < k/e, 
completing the proof. □ 

Let Ti = inf{£ : X t — — 2(Z + 1)}, so T\ is the first time after to at which we 
finish exploring a component. In particular, there is a component with T\ — To 
vertices. 

Lemma 5.2. Ti — To is asymptotically normally distributed with mean pn and 
variance 2p,\£~ 1 n, where p is defined by 

Proof. For t < T\ we have Ct < Z + 1, which is whp at most w/e + 1 by 
Lemma [5Tl Fort < min{Ti, £ max } it follows that tC t /n = 0{uj). Since tUax/n = 
6(e 3 / 2 n 1 / 2 ), the bounds (f55|) and (f56|) imply that whp 

max lEA < uVehi < a /Vu, (58) 

say. 

Ignoring the irrelevant rounding to integers, let t\ — pn. Let t^ = t\ — to 
and tf = t\ + 1 . Recalling ([ST)) , we have tf < i max = O(en) = 0(cr 2 ). Hence, 
by Lemma I3~5l M t + = O v (ao)- Since X t — Xt + St + E t it follows that 

max \X t — Xt\ < V^J&o (59) 

t<min{Ti,t+} 

holds whp. 

Let a = —x(p), so from Lemma 13.41 

a = —x(p) ~ e. (60) 

Since x(p) = and x is uniformly bounded, recalling that to — o(en) it 
follows easily that x t - and x t + are both of order eto = toao- To be concrete, if 
n is large enough, then we certainly have 

x t - > lQJujoo and x t + < — 10^/cuao. 

say. By Lemma [3.41 we have Xt > eto/2 > lOy/ujao- Also Xt increases near 
(within o(en) of) t = 0, decreases near t = ti, and is of order 0(e 2 n) in between. 
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It follows that hif to < t < t - %t > 10-y/uJcro. Let B denote the event described in 
(|59|) . Then, whenever B holds, we have X t > for to < t < min{Ti,i7}. Since 
X Tl < -2(Z + 1) < 0, and T x > t by definition, this implies T x > . 

Recall from Lemma 15.11 that Z < er whp. Suppose Z < ctq, £> holds, and 
Ti > tf . Then from B and the bound on x t + wehave X t + < — 9-y/^oo < —2Z—2, 

contradicting T\ > tf. It follows that T\ < tf holds whp. 
We claim that 

sup \Xt-X tl -a{h-t)\ = o p (ao)- (61) 

\t-ti\<t 

From (|50|) we may write X t — X tl as 

(x t -x tl ) + (S t - S tl ). 

Recalling that t\ = pn, x(p) — —a and that x is uniformly bounded, the differ- 
ence between the first term and a(t\ — t) is 0(\t — ti\ 2 /n) = 0(t^/n) = o(ao). 

Since (S t — S , t -) t l 4 - is a martingale with final variance 0(t a ) = o(<Tq), Doob's 

maximal inequality gives sup| t _ (l | <to \St — Sti \ — °p( cr o): and (|6Tj) follows. 

Recall from Lcmma l5.1l that Z, the number of components explored by time 
to, satisfies Z = o p (ero)- We have shown above that whp T% — ird{t : X t — 
—2(Z + 1)} lies between t± and tf . From ([55|1. X t is within o p (ao) of X t at 
least until T\. It follows that at time T\, we have X t = o p (ao). Since a = 0(e), 
(|6T|) thus gives 

Ti =*i +Xt 1 /a + o p (a /e). (62) 

From Lemma T3.61 (|50|) and the fact that x(p) = 0, we have that X tl is asymp- 
totically normal with mean and variance vopn. Hence X tl /a is asymptotically 
normal with mean and variance 

vopn/a 2 ~ 2pie~ 1 n, 

using (O, (fTT|) and (|60|) . Since this variance is of order e _1 n = £~ 2 <Tq, the 
o P (uo/s) error term in (|62|) is irrelevant, and T\ is asymptotically normal with 
mean t\ — pn and variance 2pis~ 1 n. Finally, from Lemma 15.11 we have To = 
o p (co/s). It follows that T\ — To is asymptotically normal with the parameters 
claimed in the theorem. □ 

We are now ready to complete the proof of Theorem 11.11 

Proof of Theorem \1.1[ Let C denote the component explored from time To to 
T\. We have already shown that C has the size claimed; two tasks remain, 
namely to study the nullity of C, and to show that there are no other 'large' 
components. 

Recall from (|2T)1) that the conditional expected number of back-edges added 
at each step satisfies 

E(0 t+1 | Ft) = Hm+i - 1 I Ft)A t /U t + 0(l/n). (63) 



23 



For the nullity, we shall consider only t < min{Ti, i max }, recalling that whp 
T\ < t max = 0(en). In this range, we have Ct < Z + 1 < 2u/e whp by 
Lemma [5.11 Also \E t \ < LuVs 3 n whp by (|58[) . Since e — > and e 3 n — > oo, if 
cj — » oo sufficiently slowly then both these bounds are o( v / en). Recalling that 
X t =X t -E t =A t -2C t -E t it follows that whp 

|X t - A t | = o{y/m) (64) 

throughout our range. 

For t < i max , Lemma [3.41 gives x t = 0(e 2 n), and it follows easily from the 
bounds above that X t is whp 0(s 2 n). Recalling that /"(1)//'(1) = ~ 1, 

and defining z by f(z) + t/n = 1 as before, by Corollary 13.31 the maximum 
relative error (for t < t max ) in approximating U t by nzf'(z) or E(?7 t+ i — 1 | Ft) 
by zf"(z)/f(z) is Op(vWn) = o p (l/v / £3^). Using ((Ml) and (JMJ) it follows 
that whp 

D* +1 = E(8 t+1 I Ft) = + o(V^) (65) 

for all t < Ti. 

Recalling that a; tl = 0, the bounds in the proof of Lemma 15.21 show that 
whp \X t \ < cjcto for < t < Ti, and that whp Ti < t± + 2t . It follows 
from (|65|) that whp no more than, say, Lo 2 aoto/n — ^^/(en) — lo 3 — o(yJ e 3 n) 
back-edges are added between time t\ and time T±. A similar bound holds for 
steps up to to and for steps between Ti and t\ = pn. Let Y = Y pn be the total 
number of back-edges found up to time pn. Using ([32]), it follows that 

\Y-n(C)\ =o p (\/i%). (66) 

Let us write 6 t +i as T>* +1 + A^ +1 , so by definition E(A* +1 | F t ) = 0. Then 
y = TJ>* + S**, where TJ>* = J2t< P n D t and S** = Et< P „ A t- N «te that D* is 
random. Recalling that X t — Xt + St and that KSt = 0, it follows from (|65p and 
the definition of = x(t/n) (see (|43"10 that 

where, as usual, z — z(t) is defined by f(z) + t/n = 1. 

It is not hard to see that the sum above sufficiently well approximated by 
the corresponding integral. Recalling that with r = t/n we have ^ = —f'{z), 
it follows that 

E£* = n f (z- f'(z)/ f i 1 )f"(z)dz + Op(V^), 

where z% corresponds to r = p. In other words, z\ is the value of z defined in 
||HJl, which as noted in the proof of Lemma [3.41 satisfies z\ = f {z\)jp,\. The 
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integrand above is the derivative of zf'(z) — f(z) — f'{z) 2 / (2/xi). It follows that 
ED* = n - & - 1 + f^j + o p (V^hl) = p*n + Op(v^), (67) 

recalling HTDJ). Since EA* = 0, this gives EY = p*n + o p (v / e%). 

From Lemma [3. z j(v) | and (H2l it follows easily that S = 1 — Z\ ~ ~ 
2ep\/pz- We may write p* as /(l — <5) — 1 + ^i<5 — ^\8 2 /2. Expanding f(z) 
about 2=1, using /(l) = 1, /'(l) = /xj, f"(l) = M2 = (1 + /'"(l) = ^3 

and /( 4 ) = O(l), a little calculation establishes that 



2/i 3 
3 M 3 



is 3 . (68) 



We now turn to the variance and covariance estimates. Here we can be much 
less careful, as a o(l) relative error does not affect our conclusions. 

Recall that D* is random. From ([55]). (|6T|) and the fact that X t = EX t + S t 
we can write -D* as p*n + D' + o p (v / £ 3 ri), where 



S /"(*) 



n f'lz) 2 ' 

Thus 

Y = D* + S* = p*n + D' + S* + o p (v / i%). 

Throughout the relevant range, f"(z)/f'(z) 2 ~ /"(1)//'(1) 2 = /Ua/Mi ~ Vmi- 
Since St = ^i, ^ follows that 

pn — 1 

for some constants aj = aj(n) satisfying 

a, - (pn - i)/(pxn). 
Since the Aj are martingale differences with variances vq + o(1), it follows that 

pn— 1 pn— 1 , o „ o 

Var(D ) ~ ^ voa, ^7T n ~ ^7I £ n ' 

l=0 Mi j=0 " ^1 ^3 

using and (HI]). Hence D* = p*n + O p (Ve 3 n) = (1 + o p (l))p*n. 
Recalling that t +\ > 2 is much less likely that 8t+i — 1, we have 

Var(A* +1 I JT t ) = Var(0 t+1 | JF t ) ~ E(9 t+1 \ F t ). 

Summing, it follows that 

pn— 1 

, 2 = J2 Var(A* +1 I T t ) = (1 + o p (l))D* = (1 + 0p (l))p*n. 

t=o 
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In particular s 2 /(p*n) converges in probability to 1. 

Recall that given T t , A t+ i and A* +1 both have conditional expectation 0. 
Recalling (|28p . their conditional covariance, which is just that of n t +i — 1 and 
6 t +i, is asymptotically Vax(rj t+1 - 1 | F t )A t /U t ~ v A t /U t = 6(e 2 ). This 
is much smaller than the square root of the product of their variances. It 
follows easily that, after appropriate normalization, the joint distribution of 



S = S pn = J2t< P n A t and N> = D' + S* = £ t<p „(a t At + AJ ) is asymptotically 



Recalling that the nullity of C is Y + o p (v s 3 n) = p*n + N' + o p (\/e 3 n) and that 

\C\ = pn + S/a + o p (y/en) where o — —x(p) ~ e, it follows that \C\ and n(C) are 
jointly asymptotically normal with the means, variances and covariance claimed 
in Theorem ll.il 

It remains only to prove that all components other than C have size bounded 
by O p (e~ 2 log(e 3 n)) = o(en) as in (fT2"l) . Lemma I57T1 shows that T = O p (e~ 2 ), 
so whp by time T\ we have found no second component larger than this. 

Let us stop the exploration at time T\. Then the unexplored part of the 
graph is simply the configuration multigraph on the degree sequence d' given by 
the vertices not yet reached. Note that A(d') is exactly the expected value, given 
the history, of the degree ??Ti+i of the vertex about to be chosen. Since we have 
explored 0(en) = o(n) vertices, d' satisfies the assumption ([2]) (with a slightly 
reduced cq), and it is still bounded. Since ^/en/n = o(e), by Corollary 13.31 
E(r/Ti+i — 1 | Ti, Ftx) = x(p) + o p (e), so we find that A(d') = 1 — a + o p (e) 
with a as in (|60|) . Since a = 6(e), Theorem 11.21 thus tells us that the largest 
component remaining has size O p (e -2 log(e 3 n)), as required. □ 

Remark 5.3. Considering a random walk with independent increments with 
distribution r\ — 2, one would expect that the probability that our random walk 
(X t ) 'takes off' without hitting —2 near the start is roughly the expected degree 
of the initial vertex times 2e/vq. (To see this, simply solve the recurrence relation 
for the probability of hitting —2 as a function of the initial value.) The expected 
degree of the initial vertex (which is chosen with probability proportional to 
degree) is roughly 2, giving a 'take-off' probability of roughly Ae/v ~ 4 : p Jl //j, 3 ; 
it is not hard to check that this is asymptotically correct in the actual process 



One should expect this probability to be simply related to (or at first sight 
equal to) p; here the difference is that p is the probability that a uniformly 
chosen random vertex is in the giant component. It is not hard to check that 
the giant component is 'tree-like', and in particular has average degree 2 + o(l), 
so the probability that a vertex chosen with probability proportional to degree 



multi-variate normal, with 



Var(iV') - Vm(D') + Var(S'*) 




and 
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is in the giant component is around 2pjp\. From (|TTj) this is asymptotically 
4e/j,i/n3, as it should be. 

This comment illustrates a strange feature of the trajectory tracking argu- 
ments here and in the papers of Nachmias and Peres [53] and Bollobas and 
Riordan [5J: there is a related viewpoint using branching processes which more 
easily gives the approximate size of the giant component, essentially by con- 
sidering the probability that a vertex is in a large component, i.e., that the 
random trajectory 'takes off'. One can prove quite accurate bounds by this 
method without worrying about when the trajectory will hit zero eventually; 
see Bollobas and Riordan [5]. However, for the distributional result, it seems 
easier to follow the whole trajectory. 

6 A local limit theorem 

As a step towards the proof of Theorem ll.2l we shall need a local limit theorem 
(Lemma l6.3l below) that may perhaps be known, but that we have not managed 
to find in the literature. This concerns a sequence (£>„) of sums of independent 
random variables. As usual in the combinatorial setting, each S n involves differ- 
ent variables; we cannot make the more usual assumption in probability theory 
that each S n is the sum of the first n terms of a single sequence. This makes 
little difference to the proofs, however. 

We start from Esseen's inequality in the following form, also known as the 
Berry-Esseen Theorem; see, for example, Petrov [24j Ch. V, Theorem 3]. We 
write 4>{x) and <&(x) for the density and distribution functions of the standard 
normal random variable. 

Theorem 6.1. Let Z\, . . . , Z t be independent random variables such that p = 
X^i=i ^(l^il 3 ) < °°> an d & = 5Zj=i ^i- Then 

swp\F(S < x) - $(0 - n)/a)\ < Ap/a 3 , 

X 

where p and a 2 are the mean and variance of S, and A is an absolute constant. 
□ 

Given an integer- valued random variable Z, let b r (Z) denote the rth Bernoulli 
part of Z, defined by 

b r (Z) = 2supmin{P(Z = i),P(Z = i + r)}. (69) 

i 

It is easy to check that for any p < bi(Z) we can write Z in the form 

Z = Z' + IB, (70) 

where / ~ Bern(p), B ~ Bern(l/2), and B is independent of the pair (Z',I). 
Here Bern(p) denotes the Bernoulli distribution assigning probability p to the 
value 1 and probability 1 — p to the value 0. Similarly, for p < b r [Z) we can 
write Z in the form Z' + rIB with the same assumptions on Z 1 , I and B. 
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We should like a 'local limit theorem' giving, under mild conditions, an 
asymptotic formula for P(5 = say, where S is a sum of independent 

random variables and \i = ES. As is well known (see, e.g., [231 Ch. VII]), 
when the summands take integer values in a finite range, the only obstruction 
is their taking values in a non-trivial arithmetic progression, in which case the 
sum cannot take certain values. Results similar to the next lemma are stated 
in |24) . but the conditions are different in important ways. 

Lemma 6.2. Let k > 1 be fixed. Suppose that for each n we have a sequence 
{Znif^l of independent random variables taking values in {— k, —k + 1, . . . , k}. 
Let S n — Yll^i Zm, and let p n and cr 2 denote the mean and variance of S n . 
Suppose that a 2 ^ = 8(i(n)), that t(n) — » oo, and that 

t(n) 

h,n = 22h{Z ni ) oo. 
i=l 

Then for any sequence (x n ) of integers satisfying x n = [J, n + 0(a n ) we have 

P ( S „ = I J„ wW = ^e x p(-<-^). (71) 

Proof. The condition \Z ni \ < k ensures that E(|Z ni | 3 ) < fc 3 = 0(1), while by 
assumption a 2 — 0(i(n)), so Theorem 16.11 gives 

sup|P(S„ <x)- $((x - Hn)/a n )\ = 0(t(n)-^ 2 ). 

X 

We shall not use exactly this bound, instead applying Theorem 16. II to a slightly 
different sum of independent variables. 

Let lo = uj(n) be an integer chosen so that cj — > oo, but lj 6 < b\ >n and 
cu 2i < t(ri), say. Choose pi — p n i so that < pi < b\(Z n i) and Y^Pi = uj6 - 

Suppressing the dependence on n in the notation, let us write Zi ~ Z n i in 
the form Zi = Z[ + IiBi as in (|70[) . where Ii ~ Bern(pi), Bi ~ Bern(l/2), Bi is 
independent of (Z-,ij), and variables associated to different i are independent. 
Let I = (Ji, . . . , I t ), and let N = |I| = £ J,. 

The idea is simply to condition on I, and thus on N. Let S° = S® = Z' i: 
and S 1 = Si = E lM =i B i> so 5„ = S° + S 1 . Then, given I, S*° and S 1 
are independent. Furthermore, the conditional distribution of S* 1 is binomial 
Bi(7V,l/2). 

In the following argument we shall consider values of iV in three separate 
ranges: N > t 1/3 , N < w 6 /2, and the 'typical' range uj 6 /2 <N< t 1 / 3 . 

Since N is a sum of independent indicators with KN — oj 6 < i 1 / 4 , standard 
results (e.g., the Chernoff bounds) imply that ¥(N > t 1 ^ 3 ) is exponentially small 
in i 1 / 3 , and hence, extremely crudely, that ¥(N > t 1 / 3 ) — o(t~ x / 2 ). Thus 

V(S„ = x n A N > t 1 ' 3 ) = air 1 ' 2 ). (72) 
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Note that E5° = fi n - EiV/2 = fi n - uj 6 /2 = fi n + o(s/i). Similarly, Var 5° = 
VarSVi + o{-\/l) = a 2 + o(V?). Let fx and a 2 denote the conditional mean and 
variance of S° given I. Given I, the summands Z[ in S° are independent, but 
their individual distributions depend on the Changing one Ii only affects 
the distribution of one summand, and all Z[ are bounded by ±fc, so we see that 
jl and a 2 change by 0(1 ) if one entry of I is changed. It follows easily that 
jl = E5° + 0(N + EN) and a 2 = Var S" + 0(N + EN). Hence, when N < t 1 / 3 
we have 

pj = fi n + o(t 1 / 2 ) and a ~ a n . (73) 

Given I, S is a sum of t independent random variables whose third moments 
are all bounded by fc 3 . By Theorem 16. H it follows that when N < t 1 ^ 3 then 

P(S° < x 1 1) = ${{x - p)/a) + 0(t~ 1/2 ) (74) 

uniformly in x. Considering consecutive (integer) values of x, it follows that 

supP(S ,0 = x|I)=O(t- 1 / 2 ) (75) 

X 

whenever N = |I| < t 1 / 3 . 

To handle the case N < w 6 /2, recall that after conditioning on I, the sums 
5° and S 1 are independent. Thus 

F(S n = x n | N < w 6 /2) < sup supP(S' = x n - m | I = I , 5 1 = m) 

I :|I |<^ 6 /2 m 

= or 1 / 2 ), 

using (175)) for the final bound. Since F(N < w 6 /2) = o(l), this gives 

P(5„ = x n A N < lu 6 /2) = air 1 ' 2 ). (76) 

Finally, consider the 'typical' case, where w 6 /2 < N < t 1 / 3 . Condition 
on I, assuming that N is in this range. Let / be an 'interval' consisting of w 
consecutive integers. By (|74|) we have 

P(S° e 1 1 I) = $(y + w/er) - + 0(t~ 1/2 ), 

where y = (mini — jj)/<r. If the endpoints of / are within o(y/i) of x n = 
(Jin + 0(a n ) then using (|73|) we have y ~ {x n — ^i n ) / a n , and it follows easily that 

P(S° e / | I) = wp + 0(* _1/2 ) ~ wpo, (77) 

where po is as in (I71|) . and we used po = 6(£ -1 / 2 ) and w — > oo in the final 
approximation. 

For a = 0, 1, 2, ... , let J a be the interval [aid, (a + l)u> — 1], and let I a = 
x n — Ja- Recalling that S° and S 1 are conditionally independent, we have 

N/u+l 

P(5„ = x n \T)> V F(S° G J 1 1) minPCfiT 1 = j | I), 
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and a corresponding upper bound with min replaced by max. Since N < t 1 / 3 — 
o(y/t), from (l77l) we have P(S* € I a | I) ~ wpo for all a < N/u>+ 1, so we obtain 

F(S n = x n | I) > (1 + o(l))wpo V min P(5 X = j | I), 

z — ' jeJa 

a 

and a corresponding upper bound with min replaced by max. Recall that, 
conditional on I, the distribution of S 1 is simply binomial Bi(iV, 1/2). Since the 
standard deviation \J~N /2 of S 1 is much larger than w, elementary properties of 
the binomial distribution imply that 

^minP(S' 1 =j | I) ~ ^maxP(S' 1 = j | I) ~ 1/w. 

a ^ a a ^ a 

(The bulk of each sum comes near the middle of the binomial distribution, where 
the point probabilities hardly change within one interval; all that is actually 
needed here is u) = o(v^V)-) This gives us an estimate for F(S n = x n \ I) valid 
whenever ui 6 /2 < N < t 1 / 3, , and it follows that 

F(S n = x n A lu 6 /2 < N < t 1 / 3 ) - p P(w 6 /2 < N < t 1 ' 3 ) - p . (78) 

Combining {71]), dZHJ) and (JTHJ) gives the result. □ 

Results somewhat similar to Lemma 16.21 are certainly known; see, for ex- 
ample, McDonald [19], where Bernoulli parts are used to deduce a local limit 
theorem from a central limit theorem. However, the assumptions are different, 
and the Bernoulli part needed is much larger. 

Note that uniform boundedness is not really needed in Lemma 16.2) a suit- 
able condition on the third moments should suffice. Also, we may replace the 
condition J2i bi{Z n i) — > oo by J2i b r (Z n i) —> oo for all r € R, where R is any 
set of integers with highest common factor 1. This latter condition is 'almost' 
necessary (after passing to a subsequence): without it there is some d (the high- 
est common factor of the integers in R) such that even distribution modulo d 
will only happen because of a 'coincidence'; see the discussion in [24] Ch. VII]. 

We shall need a result along the lines of Theorem 16.21 but away from the 
central part of the distribution. This follows easily using a trick called 'expo- 
nential tilting', introduced by Cramer [5], and suggested to us by Paul Balister. 
Let Z be a random variable, here with finite support, such that F(Z > 0) and 
F(Z < 0) are both positive. Consider the function 

f(a) = E(Ze aZ ) = Y,Px xeaX > 

X 

where p x = F(Z = x). Note that 

f'(a) = E{Z 2 e aZ ) > 0. 

Also, if the support of Z is contained in [— k,k], then |/"(a)| = |E(Z 3 e aZ )| < 
fc 3 e M fe . 
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Since f(a) is increasing and tends to ±00 as a — > ±00, there is a unique 
a = a(Z) such that /(a) = 0. Define 

c = c(Z)=E(e aZ ) = J2p*e ax , (79) 

X 

and let Z' be the random variable with 

F(Z' =x)= P(Z = x)e ax /c, 

noting that these probabilities sum to 1 by the definition of c, and that K(Z') = 
by the definition of a. It is easy to check that if St denotes the sum of t 
independent copies of Z and S' t the sum of t independent copies of Z' , then 

¥(S' t =x)=¥(S t =x)e ax /c t . (80) 

Recall that b\(Z) is the Bernoulli part of Z , defined by ([ST))) . 

Lemma 6.3. Let F be a finite set of integers, and let Z n , n > 1, be a sequence 
of probability distributions supported on F, with liminfP(Z„ < 0) > and 
liminf ¥(Z n > 0) > 0. Suppose that t — t{n) — > 00, and that tb\{Z n ) — > 00. 
Let S n denote the sum oft independent copies of Z n , and define a n — a(Z n ), 
c n = c{Z n ) and Z' n as above. Then 

P(5 n - x) ~ . 1 e-^ci 
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uniformly in integer x — o(y/t), where a\ is the variance of Z' n . 

Proof. In the light of ([80]), it suffices to prove that P(S' n = x) ~ l/(a n V2nt), 
where S' n is the sum of t independent copies of Z' n . 

Passing to a subsequence, we may suppose that ¥(Z n = i) converges for 
each i, and that there are i < and j > for which the limit is strictly 
positive. It follows that the 'tilting amounts' a — a(Z n ) are bounded. Hence 
bi{Z' n ) — @(bi(Z n )), so tbi{Z' n ) — > 00. Also, Var(Z^) is bounded below by some 
positive number. Lemma 16.21 thus applies to the sum of t independent copies of 
Z' n , giving the result. □ 

We finish this section by noting some basic properties of tilting applied to 
random variables whose mean is close to zero. 

Lemma 6.4. Let k be fixed. If Z n is a sequence of distributions on {— k, . . . , k} 
with e n — EZ„ — > and Var(Z„) = <r„ = 6(1), then the quantities a n = a(Z n ) 
and c n = c{Z n ) defined above satisfy 

a n s n /al (81) 

and 

l-c n ~e 2 J(2a 2 n ). (82) 

Furthermore, V&r(Z' n ) ~ a 2 t , and if W n is supported on {— k, ...,k} and may 
be coupled to agree with Z n with probability 1 — p n where p n = o(e n ), then 

\a(W n ) - a n \ = 0( Pn ) and \c(W n ) - c„| = 0(e nPn ). (83) 
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Proof. Suppressing the dependence on n, let f{a) = E(Ze aZ ) as above. Then 
/(0) = EZ = e, and /'(0) = EZ 2 = cr 2 + e 2 ~ cr 2 . Also, /"(a) is uniformly 
bounded for a £ [—1,1], say. It follows easily that a n , the solution to f(a) — 
0, satisfies (ED). Similarly let = E(e aZ ). Then 3 (0) = 1, g'(0) = e, 

g "(0) = /'(0) - cr 2 and g"'(a) is bounded for a = 0(1). It follows that 
Cn = g(a n ) = 1 + a n e + a 2 n a 2 /2 + 0(e 3 ), giving (|g2J. 

To see that Var(Z^) ~ cr 2 it is enough to note that a n — > 0. The final part 
may be proved using the fact that for each fixed j and all a G [—1,1], we have 
\E(Z^e- aZ -)-E(W^e- aW -)\ =0(p n ). □ 

7 The subcritical case 

In this section we prove Theorem 11.21 Although we do use the exploration 
process considered in the rest of the paper, we do not track the deviations of this 
process from its expectation; instead we use stochastic domination arguments 
to 'sandwich' the process between two processes with independent increments. 

We start with a lemma concerning the tail of the distribution of the time that 
a certain random walk with independent increments takes to first hit a given 
value. In the application we shall essentially take Z n to be the distribution of 
?7 — 2, where r\ is the degree of a vertex of our graph chosen with probability 
proportional to its degree (see Section[2]). In fact, we shall adjust the distribution 
slightly both to allow us to use stochastic domination, and to meet the condition 
tb±(Z n ) — > 00. In what follows we often suppress dependence on n in the 
notation. Recall that the Bernoulli part b\{Z) of a distribution Z is defined by 

PH. 

Lemma 7.1. Let k > 1 be fixed, and let Z n be a sequence of probability distribu- 
tions on {—1, 0,1, ... ,k} converging in distribution to some Z with Var(Z) > 0, 
such that e = e(n) = — EZ n > and e — > 0. Let W„ = (Wt)t>o be a random 
walk with Wq — and the increments independent with distribution Z n , and let 
r r = T r (n) = inf{i : Wt = —t}. Suppose that r > 1 is fixed, and that t — t(n) is 
such that e 2 t — > 00 and tbi{Z n ) — > 00. Then 

P(r r > t) ~ Cr iZ 5-H- 3/2 e- 5t , (84) 

where 5 — 5 n = — log(c(Z„)) with c(Z n ) defined as in (I79p . and c r .z > is 
some constant depending on r and Z . 

Proof. We start with the case r = 1. Here Spitzer's Lemma [27] gives P(ti = 
t) = F(Wt — — l)/i; indeed, given a sequence of possible values x\,. . . ,x t of 
the first t increments summing to — 1 , there is exactly one cyclic permutation of 
(x\, . . . , xt) such that the walk with the permuted increments stays non-negative 
up to step t. 

Lemma T6.4I gives a n — > and c n = 1 — 0(e 2 ). Thus 6 — 9(e 2 ) and, writing 
a 2 for the variance of Z' n , a ~ a. Lemma 16.31 thus gives 

F(W t = -1) ^e~ 5t , 
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whenever t — > oo with tb\(Z n ) — > oo. Hence 



( n =t)~t- 3 / 2 -±= e-*\ (85) 
ay 2ir 



When St —¥ oo, summing over i easily gives 



P(n > t) ~ j- 1 ^ 3 / 2 — =e -w . (86) 

crv27r 



Indeed, the sum is dominated by the first O^^ 1 ) = 0(e~ 2 ) terms, and in this 
range i -3 / 2 hardly changes. 

For general r we simply note that r r is distributed as the sum of r indepen- 
dent copies , . . . , of t±. Since X)t>i t~ 3 ^ 2 converges, using ([55)1 it is easy 
to see that the dominant contribution to P(r r > t) comes from the case that one 
of the is large and the others are 0(1). Convergence in distribution of Z n 
implies that for each j P(n — j) converges to some limit, so follows from 
©. □ 



We are now ready to prove Theorem 11.21 

Proof of Theorem ] 1. 6 A Let Z = Z n denote the distribution of rj — 2, recalling 
that r\ is the degree of a vertex chosen with probability proportional to its degree. 
Passing to a subsequence, we may assume that Z n converges in distribution to 
some distribution Z* . Nonetheless, in what follows we must work with the 
actual distribution Z n rather than the limit, since the bounds are sensitive to 
small changes in the distribution of Z n . 

Note that Z = Z n is supported on {—1, 0, . . . , c£ max — 2}, and that by ([7]) 
we have Var(Z) = 0(1). Also, EZ = — e, where by assumption e — > and 
e 3 n — > oo. Let S = 8(Z) = — \og(c(Z)) be defined as above, noting that 

5r .|l = 9( £ 2 ) (87) 
2v Q 

by Lemma 16.41 Note also that S is exactly the quantity S n appearing in the 
statement of Theorem 11.21 

Let A = e 3 n, recalling that A — > oo by assumption, and set 

5 

t + = (5 _1 (logA- -loglogA + w) 

for some w — > oo with u = o(loglog A). 

Let Z' be defined in the same way as Z : except that we first remove the 
2<i max < + = o(n) non-isolated vertices of lowest degree from our degree sequence. 
Then, conditional on the first t < 2t + steps of our process, using (f22j) the 
distribution of the next increment X t +\ — X t = rj t +i — 2 — 26 t +i < rj t +\ — 2 is 
stochastically dominated by Z 1 . Let 7 = £ 3 / 2 /logA, say, noting that 7 — > 0, 
jt + ~ e -1 / 2 — > 00, and ejt + — > 0. Define Z + by modifying the distribution of 
Z' as follows: Pick some k such that F(Z' — k) > 27, and shift mass 7 from 
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k to k + 1. Note that bi{Z + ) > 7, so t+6i(Z + ) -> 00. Also Z+ stochastically 
dominates Z' , and Z + and Z may be coupled to agree with probability 1 — p, 
where 

p = 0{t+/n + j). 

Note that e~ 1 t + /n = 0((log A)/A) = o(l), and clearly 7 = o(e), so p = o(e). 

Considering the random walk with independent increments distributed as 
Z + , writing C\ for the first component revealed by our exploration, stochastic 
domination and Lemma 1 7. II give 

P(|Ci| > t+) = P(inf{t : X t = -2} > t+) < (1 + o(l))c($ + )- 1 (* + )- 3 / 9 e- tf+t+ , 

where S + is defined as 6, but using Z + in place of Z, and c = C2,z* is a positive 
constant. 

Lemma l6~4l gives 5 + ~ <$, and indeed |<5 + — 5\ = O(ep). Thus |<5 + t + — 5t + \ = 
0(ept + ), which is easily seen to be o(l). Thus the bound above can be written 
more simply as 

P(|Ci|><+) < {l + o{l))c8- l {t + )- z ' 2 e- 5t+ 

= c— n<5- 1 (t+)- 5 / 2 A- 1 (logA) 5 / 2 e- a; 
n 

~ c-(2« )- 3/2 e-" 
n 

= — e(e- w ) =o(t+/n). 
n 

If our graph GJJ 1 contains a component of order at least t + , then the probability 
that we explore this component first is at least (2t + — 2)/(d max n) = <d(t + /n). 
It follows that P(Li > t + ) = o(l), proving the upper bound in (|17|) . 

Turning to the lower bound, we use stochastic domination in the other di- 
rection. This time we must account for back-edges. At a given step t < 2t + , the 
(conditional, given the history) probability of forming a back-edge is 0(t + /n), 
simply because there can only be 0{t + ) active stubs. It follows that we can 
define a distribution Z~ that may be coupled to agree with Z with probability 
1 — 0(t + jn + 7) so that the conditional distribution of X t +i — X t stochastically 
dominates Z~ whenever t < 2t + . Setting 

r =8- 1 (logA- ^ log log A -w), (88) 

the argument above adapts easily to prove that 

F(|d| >t-)~c— (2v )-V 2 e". 
n 

Let I = [t~,t+] and write t = (t~ + 1+)/2, say. Noting that t~ ~ t + ~ t, the 
bounds above combine to give P(|Ci| € I) ~ e'e^t/n, where c' = c(2vo)~ 3 ^ 2 ■ 
Let N denote the number of components C with \C\ £ I. It is easy to check 
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that with high probability no such component will have significantly more than 
\C\ edges. Since our initial vertex is chosen with probability proportional to 
its degree, it follows that P(|Ci| G J) ~ (EA^)2i / '(//in) , where [i\ is the overall 
average degree. Hence 

EN - -S- oo. 

Finally, with C2 the second component explored by our process, we have 

P(|&|, |C 2 | € I) ~ E(N(N - l))(2i/( Ml n)) 2 . (89) 

The estimates above apply just as well to bound P(|C2| € / | \C\\ € /): through- 
out, we only needed that at most 2t + vertices had been 'used up'. We find that 
the left-hand side in is asymptotically (c'e u t/n) 2 , so it follows that 

E(N(N - 1)) ~ (cVie"/2) 2 ~ {EN) 2 . 

Since EiV — ► 00, this gives E(A^ 2 ) ~ (EA^) 2 , so Chebyshev's inequality implies 
that P(N > 0) -> 1, completing the proof of (fT7|t . 

The argument for (|15|) is essentially the same; we simply replace —lo by +x in 
the definition (|88|l of t~ . With AT the number of components with order between 
this new t~ and t + it follows as above that EA~ ~ a = d ' [i\eT x /2. Moreover, 
arguing as for AT (AT — 1) above, for each fixed r the rth factorial moment of A~ 
converges to a r . It follows by standard results that A" converges in distribution 
to a Poisson distribution with mean a, so P(A~ = 0) — > e~ a . Note that the 
constant c in (fT5|) is 

c = c'fix/2 = c 2 ^(2v Q y 3 / 2 t i 1 /2 ~ c 2 , z .2- 5 / 2 ^ /2 M - 3/2 , (90) 

with C2,z* as in Lemma |7. II □ 
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